Optimization of Floating-Point based Finite Difference Methods on FPGAs
ثبت نشده
چکیده
Finite Difference (FD) methods are the most prevalent numerical modelling algorithms for evaluating initial or boundary value problems in scientific and engineering applications. Unfortunately, simulating time evolutions for transient physical phenomenon is computationally demanding and data-intensive. In this paper, we introduce an efficient implementation of FD computing engine on FPGA-based Reconfigurable Computing (RC) platform. Instead of following the formal high-order FD expressions with standard IEEE-754 compliant floatingpoint arithmetic units, we propose a new class of optimized finite-accurate FD schemes, whose FD coefficients are optimized to be represented with only a few binary bits without deteriorating numerical accuracy criterions. Furthermore, in order to simplify the implementation of subsequent floating-point summations, we replace the conventional costly floating-point adder tree by a floating-point/fixed-point hybrid accumulator based on group-alignment technology. The resulting fullypipelined FD computing engine with finite accurate coefficients can provide us similar or even better worst case relative and absolute rounding errors than standard floating-point arithmetic, but consumes only a fraction of hardware resources. Moreover, the regular layout and localized interconnections of our design lead to higher data throughput and much less pipeline stages especially for FD schemes with wide stencils.
منابع مشابه
Perspectives for the Use of Field Programmable Gate Arrays for Finite Element Computations
We have studied how the solution of partial differential equations by means of finite element methods could be accelerated using Field Programmable Gate Arrays (FPGAs). First, we discuss in general the capabilities of current FPGA technology for floating-point implementations of number crunching. Based on practical results for basic floating-point operators performance limits are outlined. Then...
متن کاملAn Independent Analysis of Floating-point DSP Design Flow and Performance on Altera 28-nm FPGAs
OVERVIEW FPGAs are increasingly used as parallel processing engines for demanding digital signal processing applications. Benchmark results show that on highly parallelizable workloads, FPGAs can achieve higher performance and superior cost/performance compared to digital signal processors (DSPs) and general-purpose CPUs. However, to date, FPGAs have been used almost exclusively for fixed-point...
متن کاملFloating-point DSP Design Flow and Performance on Altera 28-nm FPGAs
OVERVIEW FPGAs are increasingly used as parallel processing engines for demanding digital signal processing applications. Benchmark results show that on highly parallelizable workloads, FPGAs can achieve higher performance and superior cost/performance compared to digital signal processors (DSPs) and general-purpose CPUs. However, to date, FPGAs have been used almost exclusively for fixed-point...
متن کاملHybrid FPGA : architecture and interface
Hybrid FPGAs (Field Programmable Gate Arrays) are composed of general-purpose logic resources with different granularities, together with domain-specific coarse-grained units. This thesis proposes a novel hybrid FPGA architecture with embedded coarse-grained Floating Point Units (FPUs) to improve the floating point capability of FPGAs. Based on the proposed hybrid FPGA architecture, we examine ...
متن کاملAchieving One TeraFLOPS with 28-nm FPGAs
Due to recent technological developments, high-performance floating-point signal processing can, for the first time, be easily achieved using FPGAs. To date, virtually all FPGA-based signal processing has been implemented using fixed-point operations. This white paper describes how floating-point technology on FPGAs is not only practical now, but that processing rates of one trillion floating-p...
متن کامل